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CAROTENOID KETOLASE GENES AND GENE PRODUCTS, 
PRODUCTION OF KETOCAROTENOIDS AND METHODS OF 
MODIFYING CAROTENOIDS USING THE GENES 



BACKGROUND OF THE INV ENTION 
5 Carotenoids are widely distributed natural pigments that are responsible for 

many of the yellow, orange and red colors seen in living organisms. They have 
important commercial uses as coloring agents in the food industry, as feed and food 
additives, in cosmetics and as provitamin A precursors. 

The plant species Adonis aestivalis produces flowers with petals that are deep 
10 red in color and nearly black at the base of the petals due to the accumulation of 
ketocarotenoid and other carotenoid pigments (Neamtu et al M Rev. Roum. Biochim. 
6:157, 1969). This pattern of carotenoid accumulation accounts for the common name 
of some varieties of this species: summer pheasant's eye. 

Among the carotenoids identified in the petals of the red petal varieties of these 
15 various species is the ketocarotenoid astaxanthin (S^-dihydroxy-A.^-diketo-b.b- 
carotene; see Figure 1). Various other ketocarotenoids (see Figure 1) including 3- 
hydroxyechinenone (3-hydroxy-4-keto-b,b-carotene), adonirubin (3-hydroxy-4,4'-diketo- 
b,b-carotene) adonixanthin (3,3'-dihydroxy-4-keto-b,b-carotene) and isozeaxanthin 
(4,4'-dihydroxy-b,b-carotene; see T.W. Goodwin, The Biochemistry of the Carotenoids, 
20 vol I. Plants, 2nd edition, 1980, page 147) have also been reported. The latter 
compound is consistent with speculation that the 4-hydroxy may be an intermediate in 
the formation of the 4-keto group. 

SUMMARY OF THE INVENTION 
There is appreciable interest in the biological production of carotenoids, in 

25 particular the orange-colored ketocarotenoids such as astaxanthin and canthaxanthin 
(Figure 1), and in the modification of carotenoid composition. For this reason, an A. 
aestivalis flower cDNA library was constructed and screened for cDNAs encoding 
enzymes (hereinafter referred to as "ketolases" although the specific biochemical 
activity has not yet been established) involved in the conversion of b-carotene into 

30 orange compounds with absorption properties similar to those exhibited by common 
ketocarotenoids such as canthaxanthin (Figure 1). Two distinctly different Adonis 
aestivalis cDNAs were obtained from among a number of cDNAs that were selected on 
this basis. 
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Thus, a first aspect of the present invention is a purified nucleic acid sequence 
which encodes for a protein having ketolase enzyme activity and has the nucleic acid 
sequence of SEQ ID NO: 1 or 3. 

The invention also includes a purified nucleic acid sequence which encodes for 
5 a protein having ketolase enzyme activity and having the amino acid sequence of SEQ 
ID NO: 2 or 4. 

The invention also includes vectors which comprise any portion of the nucleic 
acid sequences listed above, and host cells transformed with such vectors. 

Another aspect of the present invention is a method of producing a 
10 ketocarotenoid in a host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having ketolase enzyme activity and comprises 
(1) SEQ ID NO: 1 or 3 or (2) a sequence which encodes the amino acid sequence of 
SEQ ID NO: 2 or 4, wherein the heterologous nucleic acid sequence is operably linked 
15 to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing 
the ketolase enzyme. 

Another subject of the present invention is a method of modifying the production 
of carotenoids in a host cell, relative to an untransformed host cell, the method 
20 comprising 

inserting into a host cell which already produces carotenoids a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein having 
ketolase enzyme activity and comprises (1 ) SEQ ID NO: 1 or 3 or (2) a sequence which 
encodes the amino acid sequence of SEQ ID NO: 2 or 4, wherein the heterologous 
25 nucleic acid sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence in the host cell to 
modify the production of the carotenoids in the host cell, relative to an untransformed 
host cell. 

BRIEF DESCRIP TI ON QF TH E DRAWINGS 
30 A more complete appreciation of the invention and many of the attendant 

advantages thereof will be readily obtained as the same becomes better understood by 
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reference to the following detailed description when considered in connection with the 
accompanying drawings. 

Figure 1 illustrates structures and biochemical routes leading from b-carotene to various 
of the ketocarotenoids referred to in the text. Conversion of p-carotene to astaxanthin 
5 by a hydroxylase enzyme (Hy) and a ketolase enzyme (keto) could proceed via any one 
or all of several possible routes depending on the order of the reactions. 

Figure 2 illustrates the beta ring structure of b-carotene and various modifications of this 
parent ring that might be produced through the action of the products of the A. aestivalis 
ketolase cDNAs. Also shown is the structure of the epsilon ring, not found to be a 
10 substrate for the A aestivalis ketolases and present in carotenoids such as d-carotene, 
e-carotene, a-carotene and lutein. 

Figure 3 illustrate results obtained with TLC (thin layer chromatography) separation of 
carotenoid pigments extracted from E. coli cultures, previously engineered to produce 
b-carotene, but that now also contain the A. aestivalis ketolase cDNAs and/or other 

15 introduced genes and cDNAs. The Figure indicates the empty plasmid vector 
pBluescript SK- (SK-), the Adonis aestivalis ketolase 1 cDNA in this plasmid vector (Ad 
ketol), the Haematococcus pluvialis ketolase cDNA in this plasmid vector Hp keto), or 
the Arabidopsis p-carotene hydroxylase cDNA (At Ohase). Bands that were orange in 
color are shown here with a darker fill than those with a yellow color. Identities of 

20 various bands are indicated to the right of the band. 

Figure 4 illustrates the absorption spectrum of one of the orange carotenoids produced 
from b-carotene via the action of the Adonis ketolases and makes clear the similarity 
of the spectrum to that of canthaxanthin. Absorption spectra (in acetone) of p-carotene, 
canthaxanthin and an unknown orange product (orange band #1; the lower orange 
25 band in the first lane of Figure 3) extracted from cultures after introduction of the Adonis 
aestivalis ketol cDNA (SEQ ID NO: 1) in cells of E. coli that otherwise produce and 
accumulate p-carotene. The absorption spectrum of the unknown resembles that of 
canthaxanthin but the compound migrates to a position below echinenone on RP16 
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TLC plates developed with a mobile phase of methanolracetone (1:1 by volume). The 
absorption spectrum of orange band #2 also is similar to that of canthaxanthin but it 
migrates more rapidly than canthaxanthin indicating that it is probably a more polar 
compound. 

5 Figure 5 shows SEQ ID NO: 5 (the sequence shown in this Figure includes SEQ ID NO: 

1 and also includes some of the flanking DNA from the adapator DNA and the multiple 
cloning site (MCS) of the library cloning vector, which sequences are shown in bold). 

Figure 6 shows SEQ ID NO: 6 (the sequence shown in this Figure includes SEQ ID NO: 

2 and also includes a translation of amino acids resulting from the adapator DNA and 
1 0 the multiple cloning site (MCS) of the library cloning vector and the start codon from the 

plasmid vector pTrcHis, which sequences are shown in bold and capitalized). 

Figure 7 shows SEQ ID NO: 7 (the sequence shown in this Figure includes SEQ ID NO: 

3 and also includes some of the flanking DNA from the adapator DNA and the multiple 
cloning site (MCS) of the library cloning vector, which sequences are shown in bold). 

1 5 Figure 8 shows SEQ ID NO: 8 (the sequence shown in this Figure includes SEQ ID NO: 

4 and also includes a translation of amino acids resulting from the adapator DNA and 
the multiple cloning site (MCS) of the library cloning vector and the start codon from the 
plasmid vector, which sequences are shown in bold and capitalized). 

Figure 9 shows a "Gap" alignment of the two Adonis ketolase sequences of the 
20 invention. A truncated version of SEQ ID NO: 1 is shown in this Figure for comparitive 
purposes, and is designated SEQ ID NO: 9. The percentage identity was calculated 
to be 91,107. 

Figure 10 shows a "Gap" alignment of SEQ ID NO: 2 and 4. The following results were 
found: 

25 Gap weight: 12 average match: 2.912 

Length weight: 4 average mismatch: -2.003 
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Quality: 1440 length: 307 

Ratio: 4.691 gaps: 0 

percent similarity: 92. 1 82 percent identity: 90.228 

Figure 1 1 shows a comparison between SEQ ID NO: 2 and the Arabidopsis thaliana (3- 
5 carotene hydroxylase enzyme (GenBank U58919) (SEQ ID NO: 10). 

Figure 12A shows gDNA (SEQ ID NO: 11) immediately upstream of the cDNA of SEQ 
ID NO: 3. The sequence was obtained from a PGR product generated using the 
GenomeWalker kit of Clontech Laboratories, Inc. (1020 East Meadow Circle, Palo Alto, 
CA 94303-4230) and nested primers specific to the ketolases of Adonis aestivalis 
10 (cagaatcggtctgttctattagttcttcc (SEQ ID NO: 17) and caatttgaggaatatcaaggttccttgttctc 
(SEQ ID NO: 18)). The termination codon upstream of and in-frame with initiation 
codon (TAA at positions 204-206) is shown in bold. Initiation codon (ATG) is also 
shown in bold. 

Figure 12B (SEQ ID NO: 12) indicates that the full length polypeptide of SEQ ID NO: 
15 4 begins with the amino acids MAA (shown in bold) immediately preceding the ketolase 
sequence shown in Figure 8. A similar MAA amino acid sequence immediately 
preceding SEQ ID NO: 1 is also expected. 

Figure 13 shows an alignment of SEQ ID NO: 2, SEQ ID NO: 12, an Arabidopsis (3- 
20 carotene hydroxylase enzyme (predicted product of GenBank U58919) (SEQ ID NO: 
13), a putative second Arabidopsis hydroxylase predicted by genomic DNA sequence 
(GenBank AB025606; the exon/intron junctions were chosen with reference to the 
product of the Arabidopsis 0-carotene hydroxylase cDNA u58919) (SEQ ID NO: 14), 
and two Capsicum annuum (3-carotene hydroxylases (predicted products of GenBank 
25 Y09722 and Y09225) (SEQ ID NO: 15 and 16). 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
The present invention is directed to a purified nucleic acid sequence which 
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encodes for a protein having ketolase enzyme activity and has the nucleic acid 
sequence of SEQ ID NO: 1 or 3. 

The invention also includes a purified nucleic acid sequence which encodes for 
a protein having ketolase enzyme activity and having the amino acid sequence of SEQ 
5 ID NO: 2 or 4. 

Two different but closely-related nucleic acids have been isolated. The 
sequences of the longest example of each are presented herein. Sequencing which 
has subsequently been conducted of upstream genomic DNA indicates that SEQ ID 
NO: 3 lacks bases encoding the first three amino acids (MAA; see Figure 12). Likely, 

10 this is also the case for SEQ ID NO: 1 , but the upstream genomic sequences have not 
yet been obtained for this nucleic acid. 

The two different Adonis ketolases denoted in SEQ ID NO: 1 and 3 are similar 
in sequence, sharing about 91% identity, as determined by the Gap program discussed 
below (see Figure 9), The predicted amino acid sequences of the enzymes denoted in 

15 SEQ ID NO: 2 and 4 share about 92% similarity and about 90% identity, also as 
determined by the Gap program (see Figure 10). 

Therefore, it is clear that certain modifications of SEQ ID NO: 1 or 3 or SEQ ID 
NO: 2 or 4 can take place without destroying the activity of the enzyme. Note also that 
certain truncated versions of the cDNAs of SEQ ID NO: 1 or 3 were found to be 

20 functional (i.e., these cDNAs retained the property of causing the conversion of b- 
carotene to orange compounds). Also, the Arabidopsis p-carotene hydroxylase 
(GenBank U58919), aligned with the ketolase SEQ ID NO: 2 in Figure 11, retains 
catalytic function when truncated to yield a polypeptide that lacks the first 129 amino 
acids (Sun et al M 1996). From the alignment in Figure 1 1 , therefore, this would suggest 

25 that the two ketolases of the invention retain catalytic activity after truncation to remove 
bases encoding the first 132 amino acids. 

Thus, the present invention is intended to include those ketolase nucleic acid 
and amino acid sequences in which substitutions, deletions, additions or other 
modifications have taken place, as compared to SEQ ID NO: 1 or 3 or SEQ ID NO: 2 

30 or 4, without destroying the activity of the ketolase enzyme. Preferably, the 
substitutions, deletions, additions or other modifications take place at those positions 
which already show dissimilarity between the present sequences. For SEQ ID NO: 1 , 
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as shown in Figure 9, these positions are as follows: positions 7, 20, 23, 35, 53, 63, 65, 
67, 76, 78, 85,86,91, 107, 109-111, 135, 140, 144, 146, 160, 168,217,219,241,249, 
254, 256, 271 , 291 , 296, 349, 389, 400, 406, 431 , 448, 449, 460, 471 , 499, 530, 589, 
619, 643, 653, 654, 667, 679, 709, 731, 742, 784, 787, 836, 871, 883, 896, 911, 919, 
5 928, 930, 939, 943, 967, 969, 978, 979, 982, 988, 995, 1005, 1006, 1012-1014, 1017, 
1019-1021, 1023, 1025, 1049, 1050, 1054, 1060-1068, 1070-1073, 1075, 1094, 1100, 
1101, 1106, 1107, 1109 and 1111-1176. For SEQ ID NO: 3, as shown in Figure 9, 
these positions are as follows: positions 7, 20, 23, 35, 53, 63, 65, 67, 76, 78, 85, 86, 91 , 
107, 109-111, 135, 140, 144, 146, 160, 168,217,219, 241,249,254,256,271,291, 
10 296, 349, 389, 400, 406, 431 , 448, 449, 460, 471, 499, 530, 589, 619, 643, 653, 654, 
667, 679, 709, 731, 742, 784, 787, 836, 871, 883, 896, 911, 919, 928, 930, 939, 943, 
966, 967, 970, 979, 980, 983, 989, 996, 1006, 1007, 1013-1015, 1018, 1020-1022, 
1024, 1026, 1050, 1051, 1055, 1062-1065, 1067, 1086, 1092, 1093, 1098, 1099, 1101 
and 1103-1112. 

15 For SEQ ID NO: 2 and 4, as shown in Figure 10, the following amino acids can 

be substituted or deleted, or additions or other modifications can be made, without 
destroying the activity of the ketolase enzyme: positions 7, 8, 12, 18, 21, 22, 25, 26, 36, 
37, 45, 47-49, 56, 73, 83, 85, 97, 99, 130, 144, 150, 157, 166, 218, 244, 279, 299 and 
304. Therefore, the present invention also intends to cover amino acid sequences 

20 where such changes have been made. 

In each case, nucleic acid and amino acid sequence similarity and identity is 
measured using sequence analysis software, for example, the Sequence Analysis, Gap, 
or BestFit software packages of the Genetics Computer Group (University of Wisconsin 
Biotechnology Center, 1710 University Avenue, Madison, Wisconsin 53705), MEGAIign 

25 (DNAStar, Inc., 1228 S. Park St., Madison, Wisconsin 53715), or MacVector (Oxford 
Molecular Group, 2105 S. Bascom Avenue, Suite 200, Campbell, California 95008). 
Such software uses algorithms to match similar sequences by assigning degrees of 
identity to various substitutions, deletions, and other modifications, and includes 
detailed instructions as to useful parameters, etc., such that those of routine skill in the 

30 art can easily compare sequence similarities and identities. An example of a useful 
algorithm in this regard is the algorithm of Needleman and Wunsch, which is used in the 
Gap program discussed above. This program finds the alignment of two complete 
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sequences that maximizes the number of matches and minimizes the number of gaps. 
Another useful algorithm is the algorithm of Smith and Waterman, which is used in the 
BestFit program discussed above. This program creates an optimal alignment of the 
best segment of similarity between two sequences. Optimal alignments are found by 
5 inserting gaps to maximize the number of matches using the local homology algorithm 
of Smith and Waterman. 

Conservative (i.e. similar) substitutions typically include substitutions within the 
following groups: glycine and alanine; valine, isoleucine and leucine; aspartic acid, 
glutamic acid, asparagine and glutamine; serine and threonine; lysine and arginine; and 

1 0 phenylalanine and tyrosine. Substitutions may also be made on the basis of conserved 
hydrophobicity or hydrophilicity (see Kyte and Doolittle, J. MoL Biol. 157: 105-132 
(1982)), or on the basis of the ability to assume similar polypeptide secondary structure 
(see Chou and Fasman, Adv. Enzymoi 47: 45-148 (1978)), 

If comparison is made between nucleotide sequences, preferably the length of 

15 comparison sequences is at least 50 nucleotides, more preferably at least 60 
nucleotides, at least 75 nucleotides or at least 100 nucleotides. It is most preferred if 
comparison is made between the nucleic acid sequences encoding the enzyme coding 
regions necessary for enzyme activity. If comparison is made between amino acid 
sequences, preferably the length of comparison is at least 20 amino acids, more 

20 preferably at least 30 amino acids, at least 40 amino acids or at least 50 amino acids. 
It is most preferred if comparison is made between the amino acid sequences in the 
enzyme coding regions necessary for enzyme activity. 

While the two different Adonis ketolase enzymes of the present invention are 
similar in sequence, previously-described bacterial (Misawa et al., 1995), cyanobacterial 

25 (Fernandez-Gonzalez et ah, 1997), and green algal (Haematococcus pluvialis; Lotan et 
al., 1995; Kajiwara et al., 1995) P-carotene ketolase enzymes bear little resemblance 
to the Adonis ketolases, although certain histidine motifs and features of the predicted 
secondary structure are common to the polypeptides predicted by both groups 
(Cunningham and Gantt, 1998). 

30 The present invention also includes vectors containing the nucleic acids of the 

invention. Suitable vectors according to the present invention comprise a gene 
encoding a ketolase enzyme as described above, wherein the gene is operably linked 
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to a suitable promoter. Suitable promoters for the vector can be constructed using 
techniques well known in the art (see, for example, Sambrook et al., Molecular Cloning 
A Laboratory Manual , Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1989; 
Ausubel et ah, Current Protocols in Molecular Biology r Greene Publishing and Wiley 
5 Interscience, New York, 1 991 ). Suitable vectors for eukaryotic expression in plants are 
described in Fray et al., (1995; Plant J. 8:693-701) and Misawa et al, (1994; Plant J. 
6:481-489). Suitable vectors for prokaryotic expression include pACYC184, pUC1 19, 
and pBR322 (available from New England BioLabs, Bevery, MA) and pTrcHis 
(Invitrogen) and pET28 (Novagen) and derivatives thereof. The vectors of the present 

10 invention can additionally contain regulatory elements such as promoters, repressors, 
selectable markers such as antibiotic resistance genes, etc., the construction of which 
is very well known in the art. 

The genes encoding the ketolase enzymes as described above, when cloned 
into a suitable expression vector, can be used to overexpress these enzymes in a host 

1 5 cell expression system or to inhibit the expression of these enzymes. For example, a 
vector containing a gene of the invention may be used to increase the amount of 
ketocarotenoids in an organism and thereby alter the nutritional or commercial value or 
pharmacology of the organism. A vector containing a gene of the invention may also 
be used to modify the carotenoid production in an organism. 

20 Therefore, the present invention includes a method of producing a 

ketocarotenoid in a host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having ketolase enzyme activity and comprises 
(1) SEQ ID NO: 1 or 3 or (2) a sequence which encodes the amino acid sequence of 

25 SEQ ID NO: 2 or 4, wherein the heterologous nucleic acid sequence is operably linked 
to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing 
the ketocarotenoid. 

The present invention also includes a method of modifying the production of 
30 carotenoids in a host cell, relative to an untransformed host cell, the method comprising 
inserting into a host cell which already produces carotenoids a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein having 
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ketolase enzyme activity and comprises (1 ) SEQ ID NO: 1 or 3 or (2) a sequence which 
encodes the amino acid sequence of SEQ ID NO: 2 or 4, wherein the heterologous 
nucleic acid sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence in the host cell to 
5 modify the production of the carotenoids in the host cell, relative to an untransformed 
host cell. 

The term "modifying the production" means that the amount of carotenoids 
produced can be enhanced, reduced, or left the same, as compared to an 
untransformed host cell. In accordance with one embodiment of the present invention, 

10 the make-up of the carotenoids (i.e., the type of carotenoids produced) is changed vis 
a vis each other, and this change in make-up may result in either a net gain, net loss, 
or no net change in the amount of carotenoids produced in the cell. In accordance with 
another embodiment of the present invention, the production or the biochemical activity 
of the carotenoids (or the enzymes which catalyze their formation) is enhanced by the 

15 insertion of the ketolase enzyme-encoding nucleic acid. In yet another embodiment of 
the invention, the production or the biochemical activity of the carotenoids (or the 
enzymes which catalyze their formation) may be reduced or inhibited by a number of 
different approaches available to those skilled in the art, including but not limited to 
such methodologies or approaches as anti-sense (e.g., Gray et al. (1992), Plant Mol. 

20 BioL 19:69-87), ribozymes (e.g., Wegener et al (1994) Mol. Gen. Genet 1994 Nov 
15;245(4):465-470), co-suppression (e.g. Fray et al. (1993) Plant Mol. Biol. 
22:589-602), targeted disruption of the gene (e.g., Schaefer et al. Plant J. 
1 1 :1 195-1206, 1997), intracellular antibodies (e.g., see Rondon et al. (1997) Annu. Rev. 
Microbiol. 51:257-283) or whatever other approaches rely on the knowledge or 

25 availability of the nucleic acid sequences of the invention, or the enzymes encoded 
thereby. 

Host systems according to the present invention preferably comprise any 
organism which is capable of producing carotenoids, or which already produces 
carotenoids. Such organisms include plants, algae, certain bacteria, cyanobacteria and 
30 other photosynthetic bacteria. Transformation of these hosts with vectors according to 
the present invention can be done using standard techniques. See, for example, 
Sambrook et aL, Molecular Cloning A Laboratory Manual . Cold Spring Harbor 
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Laboratory, Cold Spring Harbor, NY, 1989; Ausubel et al., Current Protocols in 

Molecular Biology , Greene Publishing and Wiley Interscience, New York, 1991. 

Alternatively, transgenic organisms can be constructed which include the nucleic 

acid sequences of the present invention. The incorporation of these sequences can 
5 allow the controlling of carotenoid biosynthesis, content, or composition in the host cell. 

These transgenic systems can be constructed to incorporate sequences which allow 

for the overexpression of the various nucleic acid sequences of the present invention. 

Transgenic systems can also be constructed which allow for the underexpression of the 

various nucleic acid sequences of the present invention. Such systems may contain 
10 anti-sense expression of the nucleic acid sequences of the present invention. Such 

anti-sense expression would result in the accumulation of the substrates of the enzyme 

encoded by the sense strand. 

Having generally described this invention, a further understanding can be 

obtained by reference to certain specific examples which are provided herein for 
15 purposes of illustration only and are not intended to be limiting unless otherwise 

specified. 

EXAMPLE 1 

Isolation of plant cPNAs that convert b-carotene into compounds w ith k etocarotenoid- 
like spectra 

20 A flower cDNA library from the plant Adonis aestivalis was introduced into a 

strain of Escherichia coli engineered to accumulate the yellow carotenoid pigment fl- 
ea rotene (see Cunningham et al M Plant Cell 8:1613-26, 1996). This strain of £. coli 
normally forms yellow colonies when cultures are spread on a solid agar growth 
medium. Ketocarotenoids that are derived from b-carotene, such as echinenone and 

25 canthaxanthin (Figure 1 ), are, in contrast, orange to orange-red in color. Colonies that 
were orange rather than yellow in color were visually selected, and the DNA sequences 
of the Adonis aestivalis cONAs within the plasmid vectors contained in these colonies 
were ascertained. Two distinct cDNAs were obtained from analysis of cDNA inserts in 
plasmids obtained from approximately 10 selected colonies. The DNA sequences of 

30 these two ketolase cDNAs are presented herein. 

The products produced by the ketolases of the invention which have been 
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expressed in a P-carotene-accumulating strain of Eschericia coli have not yet been 
identified. As many as 5 or 6 different colored bands, in addition to the substrate P- 
carotene, may readily be discerned by C 18 TLC separation (see Figure 3). To provide 
appropriate standards to assist in identification, an H. pluvialis ketolase and an 
5 Arabidopsis (3-carotene hydroxylase were separately introduced into the p-carotene- 
accumulating E. coli to produce echinenone (3-keto-P,p-carotene) and canthaxanthin 
(3,3-diketo-p,P-carotene) or p-cryptoxanthin (4-hydroxy-P, p-carotene) and zeaxanthin 
(4,4'-dihydroxy-p,P-carotene). None of the compounds formed in the presence of the 
ketolases of the invention (no difference was observed in products formed in the 

10 presence of the two different nucleic acid sequences of the invention) both migrate in 
the TLC system and have the absorption spectrum expected for echinenone, 
canthaxanthin, P-cryptoxanthin, or zeaxanthin. Two of the colored TLC bands produced 
in the presence of the Adonis ketolase cDNAs are orange in color. Orange band #1 
has an absorption spectrum similar to that of canthaxanthin (see Figure 4) but migrates 

15 in a position that indicates a polarity intermediate to echinenone and p-carotene. 
Orange band #2 also has an absorption spectrum like that of canthaxanthin but 
migrates in a position that indicates a polarity intermediate to canthaxanthin and 
zeaxanthin (see Figure 3). The absorption spectra and TLC results suggest that the 
two orange products could be desaturated at the 3-4 positions of both rings (3,4,- 

20 didehydro; see Figure 2). Orange band #1 (see Figure 3) might then be 3 f 4£'A'- 
tetradehydro-P,p-carotene. To substantially affect the absorption spectrum of the 
substrate P-carotene, any modifications very likely involve a carbon that lies in 
conjugation with the conjugated chain of carbon-carbon double bonds that constitute 
the chromophore (Goodwin, 1980; The Bioch emistr y of the Carotenoids , volume I; 2 nd 

25 edition, Chapman and Hall). For the spectra obtained, only the carbons at the number 
4 position of the two rings appear to be plausible locations for modification. The 
multitude and TLC migrations of the yellow and orange products produced from the 
symmetrical P-carotene, however, also indicates that the enzymes of the invention carry 
out more than a single type of reaction. The apparent homology of the ketolases of the 

30 invention to the Arabidopsis P-carotene hydroxylase would suggest that compounds 
with a hydroxyl at the 3 and/or 4 positions of one or both rings are another possible 
outcome (see Figure 2). In fact, such compounds have been identified in Adonis (see 
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above), and it has long been conjectured that a hydroxyl at position 4 is an intermediate 
in the formation of the 4-keto (e.g. crustaxanthin, a 3,3\4,4' tetrahydroxy carotenoid that 
might be a precursor for astaxanthin in the exoskeleton of the lobster). The histidine 
motifs and secondary structure in common to the hydroxylase and ketolase enzymes 
5 are characteristics of a large group of di-iron oxygenases whose members also include 
examples of desaturases (J. Shanklin, 1998, Ann. Rev. Plant Physiol. Plant Mol. BioL), 
therefore a 3-4 desaturation (and/or perhaps a 2-3 desaturation in one or more of the 
yellow compounds) would also seem a plausible outcome. 

To summarize the results of this example for the Adonis ketolases of the 

10 invention, a number of different carotenoids, including two with ketocarotenoid-like 
spectra, are produced from (3-carotene via the action of the products of either of the two 
different nucleic acids of the invention. These orange compounds appear to be the 
major products. Truncation and fusion of the cDNAs to a stronger promoter in the 
vector pTrcHis (Invitrogen) was detrimental to growth of E. coli but did result in 

15 improved yield of the most polar orange product (orange band #2 in Figure 3). 
Introduction of a cyanobacterial ferredoxin did not change the yield or relative amounts 
of the various products. Without being bound by theory, it may be that the 
ketocarotenoids produced in flower petals of Adonis actually include the as yet 
unidentified orange compounds that are produced in E. coli using the nucleic acids of 

20 the invention. 

EXAMPLE 2 
Substrate specificity of the Adonis ketolases 

Carotenoids with e rings are common in plants. The e ring differs from the b ring 
only in the position of the double bond within the ring (Figure 2). The e ring is reported 

25 to be a poor substrate for the Arabidopsis b-carotene hydroxylase (Sun et al., 1996). 
The Adonis ketolase cDNAs were introduced into strains of E. coli engineered 
(Cunningham et al., 1996) to accumulate carotenoids with one or two e rings (d- 
carotene and e-carotene), or the acyclic carotenoid lycopene. TLC analysis of acetone 
extracts revealed that these carotenoids were not modified by the Adonis ketolases. 

30 as indicated by a lack of any new products formed. Products produced in £. coli 
engineered to accumulate zeaxanthin (Sun et al., 1996) appeared to be the same as 
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for p-carotene accumulating cultures indicating that a 3-OH is likely to be one of the 
functional groups introduced to the b ring by the Adonis ketolases. The more polar 
orange band produced from b-carotene through the action of the Adonis ketolases (e.g., 
orange band 2 in Figure 3), therefore, could very well be 3,3 , -dihydroxy-3 ) 4,3 , 1 4 , - 
5 tetradehydro-b,b-carotene. 

The references cited in the application, along with the following references, are 
incorporated by reference: 

Bouvier F, et ah (1998) Xanthophyll biosynthesis: molecular and functional 
characterization of carotenoid hydroxylases from pepper fruits (Capsicum annuum L). 
10 Biochim Biophys Acta. 1391:320-8 

" Breitenbach J, et al. (1996) Expression in Escherichia coli and properties of the 
carotene ketolase from Haematococcus pluvialis. FEMS Microbiol Lett. 140:241-6 

Cunningham FX Jr, Gantt E (1998) Genes and enzymes of carotenoid biosynthesis in 
plants. Ann Rev Plant Physiol Plant Mol Biol 49: 557-583 

1 5 Fernandez-Gonzalez B, et al. (1 997) A new type of asymmetrically acting beta-carotene 
ketolase is required for the synthesis of echinenone in the cyanobacterium 
Synechocystis sp. PCC 6803. J Biol Chem. 272:9728-33 

Fraser PD, et al. (1997) In vitro characterization of astaxanthin biosynthetic enzymes. 
J Biol Chem. 1997272:6128-35 

20 Fraser PD f et al. (1998) Enzymic confirmation of reactions involved in routes to 
astaxanthin formation, elucidated using a direct substrate in vitro assay. Eur J Biochem. 
252:229-36 

Harker M, et al. (1997) Biosynthesis of ketocarotenoids in transgenic cyanobacteria 
expressing the algal gene for beta-C-4-oxygenase, crtO. FEBS Lett. 404:129-34 
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Kajiwara S, et al. (1995) Isolation and functional identification of a novel cDNA for 
astaxanthin biosynthesis from Haematococcus pluvialis, and astaxanthin synthesis in 
Escherichia coli. Plant Mol Biol. 29:343-52 

Lotan T, et al. (1995) Cloning and expression in Escherichia coli of the gene encoding 
beta-C-4-oxygenase, that converts beta-carotene to the ketocarotenoid canthaxanthin 
in Haematococcus pluvialis. FEBS Lett. 364:125-8 

Misawa N, et al. (1995) Canthaxanthin biosynthesis by the conversion of methylene to 
keto groups in a hydrocarbon beta-carotene by a single gene. Biochem Biophys Res 
Commun.209:867-76 

Misawa N, et al. (1995) Structure and functional analysis of a marine bacterial 
carotenoid biosynthesis gene cluster and astaxanthin biosynthetic pathway proposed 
at the gene level. J Bacteriol. 177:6575-84 

Miura Y, et al. (1998) Production of the carotenoids lycopene, beta-carotene, and 
astaxanthin in the food yeast Candida utilis. Appl Environ Microbiol. 64:1226-9 

Shanklin J, et al. (1997) Mossbauer studies of alkane omega-hydroxylase: evidence for 
a diiron cluster in an integral-membrane enzyme. Proc Natl Acad Sci USA. 94:2981-6 

Shanklin J, Cahoon EB (1998) Desaturation and related modifications of fatty acids. 
Ann Rev Plant Physiol Plant Mol Biol 49: 61 1-641 

Wang CW, et al.Engineered isoprenoid pathway enhances astaxanthin production in 
Escherichia coli. Biotechnol Bioeng. 1999 Jan 20;62(2):235-41. 
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I claim: 

1. A method of producing a ketocarotenoid in a host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having ketolase enzyme activity and has the 
5 nucleic acid sequence of SEQ ID NO: 1 or 3, wherein the heterologous nucleic acid 
sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing 
the ketocarotenoid. 

2. The method of claim 1, wherein the host cell is selected from the group 
10 consisting of a bacterial cell, an algal cell and a plant cell. 

3. A method of producing a ketocarotenoid in a host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having ketolase enzyme activity and has a 
sequence which encodes the amino acid sequence of SEQ ID NO: 2 or 4, wherein the 
15 heterologous nucleic acid sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing 
the ketocarotenoid. 

4. The method of claim 3, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

20 5. A method of modifying the production of carotenoids in a host .cell, relative to an 
untransformed host cell, the method comprising 

inserting into a host cell which already produces carotenoids a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein having 
ketolase enzyme activity and has the nucleic acid sequence of SEQ ID NO: 1 or 3, 
25 wherein the heterologous nucleic acid sequence is operably linked to a promoter; and 
expressing the heterologous nucleic acid sequence in the host cell to 
modify the production of the carotenoids in the host cell, relative to an untransformed 
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host cell. 

6. The method of claim 5, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

7. A method of modifying the production of carotenoids in a host cell, relative to an 
5 untransformed host cell, the method comprising 

inserting into a host cell which already produces carotenoids a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein having 
ketolase enzyme activity and has a sequence which encodes the amino acid sequence 
of SEQ ID NO: 2 or 4, wherein the heterologous nucleic acid sequence is operably 
10 linked to a promoter; and 

expressing the heterologous nucleic acid sequence in the host cell to 
modify the production of the carotenoids in the host cell, relative to an untransformed 
host cell. 

8. The method of claim 7, wherein the host cell is selected from the group 
15 consisting of a bacterial cell, an algal cell and a plant cell. 

9. A purified nucleic acid sequence which encodes for a protein having ketolase 
enzyme activity and has the nucleic acid sequence of SEQ ID NO: 1 . 

10. A purified nucleic acid sequence which encodes for a protein having ketolase 
enzyme activity and has the nucleic acid sequence of SEQ ID NO: 3. 

20 11. A purified nucleic acid sequence which encodes for a protein having ketolase 
enzyme activity and has a sequence which encodes the amino acid sequence of SEQ 
ID NO: 2. 

12. A purified nucleic acid sequence which encodes for a protein having ketolase 
enzyme activity and has a sequence which encodes the amino acid sequence of SEQ 
25 ID NO: 4. 
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13. A vector which comprises the nucleic acid sequence of any one of claims 9-12, 
wherein the nucleic acid sequence is operably linked to a promoter. 

14. A host cell which is transformed with the vector of claim 13. 

15. The host cell of claim 14, wherein the host cell is selected from the group 
5 consisting of a bacterial cell, an algal cell and a plant cell. 

16. The host cell of claim 14, wherein the host cell is a photosynthetic cell. 

17. The host cell of claim 14, wherein the host cell contains a ketocarotenoid. 

18. The host cell of claim 14, wherein the host cell contains modified levels of 
carotenoids, relative to an untransformed host cell. 

10 19. A purified ketolase enzyme which is encoded by the amino acid sequence of 
SEQ ID NO: 2. 

20. A purified ketolase enzyme which is encoded by the amino acid sequence of 
SEQ ID NO: 4. 
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Figure 5 [SEQ ID NO: 5] 
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ggg 


ctgcaggaat 


tegg-cacgag 


1 


agcaaicrra 


gcgttcagta 


caagttattc 


tttccacaag 


aatctcttgt 


51 


tgcacrcaaa 


acaagacatt 


ctcaaccgcc 


catgttt-gct 


cttctctcca 


101 


gt £ grace 33 


agtcgectat 


gagaaagaaa 


aagacacatc 


gtgctgcatg 


151 


tatctgetet 


gtegcagaga 


gaacaaggaa 


ccttgatatt 


cctcaaatcg 


201 


aagaagagga 


agagaacgag 


gaagaactaa 


tagaacagac 


-ggattctggc 


251 




taaagaaaac 


gctagggggg 


aaacaatcaa 


gacggtccac 


301 


zggczecazz 


gtcgcacccg 


tatcttgect 


tgggatcctt 


tcaatgatcg 


351 




rcacttcaag 


ttttcaegge 


taatggagcg 


tgga-gatatt 


401 


cctgtcgcag 


aaatggggat 


tacgtttgcc 


gcctttgttg 


ctgetgegat 


451 


tggcacggaa 


tttttgtcag 


gatgggttca 


caaagaactc 


tggcacgatt 


501 


etttgeggra 


cattcacaag 


tctcaccata 


ggtcacgaaa 


aggccgcttc 


551 






tattattaac 


gcgcttcctg 


etattgetet 


601 




ggattctcaa 


atgaaggect 


ccttcctgga 


gectgetttg 


651 


g tacegg* zz 


rggaacgaca 


gtctgtggca 


tggcttacat 


ttttcttcac 


701 


aatggccrrr 


cacaccgaag 


gttcccagta 


gggcttattg 


caaacgtccc 


751 




aagctggctg 


cagctcacca 


aatccatcac 


tcaggaaaat 


801 


ttcagggcgt 


accatttggc 


ctgt-tccttg 


gaccccagga 


attggaagaa 


851 


gtaagaggag 


gcactgaaga 


attggagagg 


gtgatcagtc 


gtacagctaa 


901 


aegaaegcaa 


tcatctacat 


gaatcaactc 


ttttacattt 


at-ga-ggtttt 


951 


agtttatcgg 


tgttacaagt 


cacacatttg 


tgtcgttgta 


gta-attcaaa 


1001 


gttaccarac 


tcttttttag 


aatttttttt 


tgatgtatag 


gtogcggagt 


1051 


tacggttaca 


aaggccaaat 


ctattgttgt 


ggaattccat 


tattaaaaat 


1101 


aaaaatiaga 


-gtttgtagtt 


ttatctggtg 


atcaatatca 


atatatatta 


1151 


at taaagcaa 


aaaaaaaaaa 


aaaaaa ctcgag 
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Figure 6 [SEQ ID NO: 6] 

MGLQEFOTR 

aisvfstsys fhknlllhsk qdilnrpcll fspwvespm rkkkthraac 
icsvaertrn Idipqieeee eneeelieqt dsgiihikkt lggkqsrrst 
gsivapvscl gilsmigpav yfkfsrlmec gdipvaemgi tfaafvaaai 
gteflsgwvh kelwhdslwy ihkshhrsrk grfefndvfa iinalpaial 
inygfsnegl Ipgacfgtgi gttvcgmayi flhnglshrr fpvglianvp 
yfhklaaahq ihhsgkfqgv pfglflgpqe leevrggtee lervisrtak 
rtqsst* 
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ggg 


ctgcaggaat 


tcggcacgag 


1 


agcaatttca 


gtgttcagtt 


caggrtattc 


tttctacaag 


aatctcttgt 


51 


tggactcaaa 


accaaatatt 


ctcaaacccc 


catgcctgct 


attctctcca 


101 


gttgtgatca 


rgtcgcctat 


gagaaagaaa 


aagaaacatg 


gtgatccatg 


151 


tatctgctcc 


gttgcaggga 


gaacaaggaa 


ccttgatatt 


cctcaaattg 


201 


aagaagagga 


agagaatgtg 


gaagaactaa 


tagaacagac 


cgattctgac 


251 


atagtgcata 


taaagaaaac 


actagggggg 


aaacaatcaa 


aacggcccac 


301 


tggctccat z 


gtcgcacccg 


tarcr-gtct 


tgggatcctt 


tcaatgactg 


351 


gacctgCwg: 


rtacttcaag 


rct^cacggc 


taatggaggg 


tggagai ata 


401 


cctgtagcag 


aaatggggat 


tacgrttgcc 


accttitgttg 


ctgctccrgt 


451 


tggcacggag 


tttttgtcag 


catgggttca 


caaagaactc 


tggcacgagt 


501 


ctttgtggta 


cattcacaag 


tctcaccatc 


ggtca-cgaaa 


aggccgcntc 


551 


gagtrcaatg 


atgtgtttgc 


tatrartaac 


gcgcttcccg 


ctattgctct 


601 


tatcaattat 


ggattctcca 


atgaaggcct 


ccttcctgga 


gcgtgcrttg 


651 


gtgtcggtct 


cggaacaaca 


gtCugtggta 


tggcttacat 


ttttctrcac 


701 


aatggccrat 


cacaccgaag 




tggcttattg 


cgaacgt: ccc 


751 


ttatttccac 


aagctggctg 


cagcrcacca 


aatacaccac 


tcaggaaaat 


801 


ttcagggtgt 


accatttggc 


ctgtcccttg 


gacccaagga 


attggaagaa 


851 


gtaagaggag 


gcactgaaga 


gttggagagg 


gtaatcagtc 


gtacaactaa 


901 


acgaacgcaa 


ccatctacct 


gaarcaattt 


ttttacatat 


ataaggrttt 


951 


agtttatcgg 


tgttataaaa 


■tcacacatcc 


gtatcgtttt 


agtaagccaa 


1001 


agttaagata 


cttccttctt 


agaatatttt 


ttgatgtata 


ggtcgcggat 


1051 


atactgttac 


actattcgtt 


gtggaattcc 


attataaaaa 


aaaaaaaaaa 


1101 


aaaaaaaaaa 


aa ctcgag 
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Figure 8 [SEQ ID NO: 8] 

MGLQEPGTR 

aisvfssgys fyknllldsk pnilkppcll f spwimspm rkkkkhgdpc 
icsvagrtrn laipqieeee enveelieqt dsdivhikkt Iggkqskrpt 
gsivapvscl gilsmigpav yfkfsrlmeg gdipvaemgi tfatfvaaav 
gteflsawvh kelwhsslwy ihkshhrsrk grfefndvfa iinalpaial 
inygfsnegl Ipgacfgvgl gttvcgmayi fihnglshrr fpvwlianvp 
yfhklaaahq ihhsgkfqgv pfglflgpke leevrggtee lervisrttk 
rtqpst* 
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Figure 9: Gap of SEQ ID NO: 9 and SEQ ID NO: 3 
1 agcaatctcagtgttcagtacaagttattctttccacaagaatctcttgt 50 

iniii iiiiiiiiiin ii iiiiiiiini iiiiiiiiiiiiiii 

1 agcaatttcagtgttcagttcaggttattctttctacaagaatctcttgt 50 
51 tgcactcaaaacaagacattctcaaccgcccatgtttgctcttctctcca 100 

II MINIMI I I MINIMI MM II I I I I I I II Mill 

51 tggactcaaaaccaaatattctcaaacccccatgcctgctattctctcca 100 
101 gttgtggzggagtcgcctatgagaaagaaaaagacacatcgtgctgcatg 150 

MINI I MIIIMIIIIIIIIMIIIIII I I II III I MM 

101 gttgtgatcargtcgcctatgagaaagaaaaagaaacatggt-gatccatg 150 
151 tatctgctcrgttgcagagagaacaaggaaccttgatattcctcaaattg 2 00 

II I II I Mi III II II 1 1 1 1 1 1 1 1 1 II 1.1 1 1 1 1 IIIIIIIIIIIIIII 

151 tatctgcrccgttgcagggagaacaaggaaccttgatattcctcaaattg 200 
201 aagaagagcaagagaacgaggaagaactaatagaacagacggattct-ggc 2 50 

MIMIMMIIMM I IIIMiMIMIIMMMII lllllll I 

201 aagaagaggaagagaatgtggaagaactaatagaacagaocgattctgac 250 
251 at aatt caz a taaagaaaacgctaggggggaaacaatcaagacggtocac 300 

Ml I MMMIMIMM MIMMMMIMMIM MM MM 

251 atagtgcarataaagaaaacactaggggggaaacaatcaaaacggcccac 3 00 
301 tggctccaztgtegcacccgtatcttgtcttgggatcctttcaatgatcg 350 

M I II 1 1 1 ! ! II M M 1 1 1 M 1 1 M I II M M M M I II II I II M II I 

301 tggctccat-gtcgcacccgtatcttgtcttgggatcctttcaat-gattg 350 
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Figure 9 (cont.) 

351 gacctgctgtttacttcaagttttcacggctaatggagtgtggagatatt 4 00 

MMMMMMMMMMMMMMMMMMM MM HUM 

351 gacctgctgtttacttcaagttttcacggctaatggagggtggagatata 4 00 
401 cctgtcgcagaaatggggattacgtttgccgcctttgttgctgctgcgat 450 

M M I M I M M ! I M I M M M M M M M M M M M M M M I 

4 01 cctgtagcagaaatggggattacgtttgccacctttgttgctgctgctgt 4 50 
451 cggcacggaarttttgucaggatgggtzcacaaagaaccctggcacgatt. 500 

MMMM! MMMMM M I M M M M M M M M M M M M I 

451 tggcacggagtttttgtcagcatgggttcacaaagaactctggcacgagt 500 
501 crttgtggtacattcacaagtctcaccataggtcacgaaaaggccgcttc 550 

MIMMMMMMMMMMMMM MMMMMMMIMMI 

501 ctttgrggtacattcacaagtctcaccatcggtcacgaaaaggccgcttc 55 0 
551 gagttcaatgatgtgtttgctattartaacgcgcttcctgctattgctct 600 

MMMMMMMMMMMMMMMMMMM MMMMMI 

551 cagttcaatgatgtgtttgctattattaacgcgcttcccgctattgctct 600 
601 tatcaattatggattctcaaatgaaggccrccttcctggagcctgctttg 650 

MMIMMMMMMI MMMMMMMMMMMI MMMI 

601 tatcaattatggattctccaatgaaggcctccttcctggagcgtgctttg 650 
651 gtaccggtcttggaacgacagtctgtggcatggcttacatttttcttcac 700 

1 1 1MMMMMI MMMMMI MMMMMMMMMMI 

651 gtgtcggtcttggaacaacagtctgtggtatggcttacatttttcttcac 700 
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Figure 9 (cont.) 

701 aatggcctttcacaccgaaggttcccagtagggcttattgcaaacgtcoc 750 

MIMIII IIIIIIIMIIIIIIMIIM MINIUM MINIM 

701 aatggcctatcacaccgaaggttcccagtatggcttattgcgaacgt-ccc 750 
751 ttatttccacaagctggctgcagctcaccaaatccatcactcaggaaaat 80O 

i ii ii ii ii i ii 1 1 1 ii 1 1 ii ii ii ii i ii ii i ii i ii ii ii ii i ii i 

751 ttatttccacaagctggcrgcagctcaccaaatacaccactcaggaaaat 8 00 
8 01 tccagggtgraccatttggcctgtr ccttggaccccaggaat cggaagaa £50 

I i I ! 1 1 1 1 1 1 f 1 1 ! ! I ! 1 1 1 1 1 1 1 { ! 1 1 1 1 1 1 1 E 1 1 I I ! f I i J f 1 1 1 1 1 

801 ttcagggtgtaccatttggcctgttccttggacccaaggaattggaagaa 850 
851 gtaagaggaggcactgaagaattggagagggtgatcagfccgtacagetaa 900 

I I II II I II I I II II II I II I 1 1 1 1 1 II I! I II II I I I II I I I II I I 

851 gtaagaggaggcactgaagagttgcagagggtaatcagtcgtacaactaa 900 
901 acgaacgcaatcatctacaTGAatcaactcttttacatttatgaggtttt 950 

I I II II 1 1 1 1 II II II I I II 1 1 1 1 1 I N I II I II III II II I II 

901 acgaacgcaaccatctaccTGAatcaatttttttacatatataaggtttt 950 

* • • . • 

951 agtttatcggtgtta . caagtcacacatttgtgtcgttgta'gtaatt-caa 999 

I M 1 1 f 1 M M I M I II INIINI II INN NNN INI 

951 agtttatcggtgttataaaatcacacatccgtat-cgttttagtaagtcaa 1000 

1000 agttaccatactcttttttagaattttttttt-gatgtataggtcgcggag 1049 

INN NNI II I II NININNINNNNNN 

1001 agttaagatacttccttcttagaatattttttgatgtataggtcgcggat 1<OSO 
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Figure 9 (cont.) 

* > « • * 

1050 ttacggttacaaaggccaaatctattgttgtggaattccattattaaaaa 1099 

III Mill I I llllllllllllllllll Mill 

1051 atactgttrac actattcgttgtggaattccattataaaaaa 1091 

1100 taaaaattagagtrcgtagttttarctggcgatcaatatcaatatatatt 1149 

MM I I 

1092 ataaaaaaaaaaaaaaaaaaa 



SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 9961 652A1_I_> 



WO 99/61652 



PCT/US99/1045S 



13/16 

Figure 10: Gap of SEQ ID NO: 2 and SEQ ID NO: 4 

. * • • 

1 A I S VF S T £ Y S FHKNLLLH S KQD I LNRP C LL F S PWVES PMRKKKTHRAAC 50 

MINI. Mhllill II .11 I I I I M I I I • I I I I I I I I I 
1 AISVFSSGYSFYKNLLLDSKPNILKPPCLLFSPWIMSPMRKKKKHGDPC 50 

• • • 

51 I CS VAERTRMLD I ?Q I EEEEENEEEL I EQTDSG 1 1 H I KKTLGGKQS RRS T 100 

INN MMIIiUMIMM IIIMIIII hllllllllllhl I 

51 I CS VAGRTRNLD I ?Q I EEEEENVEEL I EQTDSD I VH I KKTLGGKQS KRPT 100 
. • • • * 

101 G £ I VAP VS ZLGILSMIG ? AVY FKF S E.LME C GD I ? VAEMG I T F AA F VAAA I 15 0 

1 1 1 1 1 1 1 : : 1 1 ii M 1 1 M 1 1 1 1 1 1 1 1 ! ! ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 • 

101 GSIVAPVSCLGILSMIGPAVYFKFSRLMEGGDIPVAEMGITFATFVAAAV 150 
151 GTEFLSG^-IKELWHDSLWYIHKSHHRSRKGRFEFNDVFAIINALPAIAL 200 

llllll MlllllhllMIIMIIIMIIIIIMIIMIIIIIIIIII 

151 GTEFLSAW\"-iKELWKESLWYIHKSKHRSRKGRFEFNDVFAIINALPAIAL 200 
2 01 I NYGF SNZ3LL P GACFGTGLGTTVCGMA Y I FLHNGL S HRRF P VGL I ANVP 250 

I I 1 1 1 I ! : M I I I 1 1 I I I I I I I 1 1 1 I I I I ! I I I I I I I I I I I I llllll 

201 INYGFSNEGLLPGACFGVGLGTTVCGMAYIFLHNGLSHRRFPVWLIANVP 250 
251 YFKKIAAAHQIKHSGKFQGVPFGLFLGPQELEEVRGGTEELERVISRTAK 300 

MIIHMMimilllllllimil-miMIIIIMIIMIII I 

251 YFHKLAAAHQIHHSGKFQGVPFGLFLGPKELEEVRGGTEELERVISRTTK 300 



301 RTQSST* 307 

III III 
301 RTQPST* 3 07 



SUBSTITUTE SHEET (flUL€ 26) 

BNSDOCID: <WO 9961652A1_I_> 



PCT/US99/104S5 

14/16 

Figure 1 1 : Gap of SEQ ID NO: 2 and Arabidopsis {3-carotene hydroxylase (SEQ 
ID NO: 10) 

1 AISV?£TSYSFHKNLLLHSKQDII2^ 50 

- II- -I II - J | • : | • 

1 MAAXLSTAVTFK? . . . LHRSFSSSSTDFRLRLPKSLSGFSPSLRFKRFSV 47 

52 ICSVAERTRNLDIPQIEEEEENEEELIEQTDSGI IHIKKTLGGKQSRRST 100 

I M -I I II : : : . | |.| Ml 

4 8 C YWEERRQNS P I ENDERPE S TS £ TNAI DAE YLALRLAEKLERKKS ERS T 97 

101 G S I VA? VS CLG I L S M I G P A VYF KF £ RLME CGD I P VAEMG I T FAAF VAAA I 150 

I 'I -I II II I I I : = I I I I I : I • I I III I II- 
98 YLIAAMLSSFGITSMAVMAVYYRFSWQMEGGEISMLEMFGTFAIjSVGAAV 147 

151 GTEFLSGWVHKELWHDSLWYIHKSEERSRKGRFEFNDVFAIINALPAIAL 200 

I II • ! h III III Mill: I- 1 II I I lllhii Ml I 

14 8 GMEFWARWAHRALWHAS LWNMHE SHKKPRE G P FELNDVF A I VNAGP AI GL 197 
201 INYGFSNEGLLPGACFGTGLGTTVCGMAYIFLHNGLSHRRFPVGLIANVP 250 

-III l-ll-ll Ml III II MM. Ml Mill I I Ml 

198 LSYGFFNKGLVPGLCFGAGLGITVFGIAYMFVHDGLVHKRFPVGPIADVP 247 

• • * * • 

251 YFHKIAAAHQIHHSGKFQGVPFGLFLGPQELEEVRGGTEELERV1SRTAK 3 00 

I MMMIMMIhlllllMIIIIIMII:MII I 

24 8 YLRKVAAAHQLHHTDKFNGVPYGLFLGPKELEEV ♦ GGNEELDKEI SRR1K 2 96 

3 01 RTQSST* 3 07 

297 SYKKASGSGSSSSS* 311 
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Figure 12A (SEQ ID NO: 11) 

I CATACCATAA A7AGTAGAGG ACAACCTACA AACCAACCAC CAGAAACCTC 50 
51 CAATGGCAGC 



Figure 12B (SEQ ID NO: 12) 

MAAAI3VFSSGYS7YKNLLLDSKPNILK?? 

TIl^rLDIPQIEEEZZNVZZLIZQTDSDIvrllKK— GGXQSXR?TGSIVA?VSCLGI1SMIG 
P AVYF KF S RL.MEGGD I ? VAEMG I TFAT™ /AAA VGTEFL SAWVEKZLWHESLWYIHKSHHR 
SRKG?.r EFNDVFAZ INALPAIALINYGFSNEGLZPGACFGVGLGTTVCGMAYI FLHNGLS 
ERRFPVW LI ANVP YFKXLAAAEQ IKHSGKFQGVPFGLFLGPKZLEEVRGGTEZLERVISR 
TTKRTQPST* 
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Figure 13 



Atl 

At2 

Cal 

Ca2 

AdRl 

AdX6 



* 20 * 40 * 60 

MAAXLSTAVTPKPLHRSPSSSSTDFRLRLPKsXsgf spslR fi 

MAAGLSTIAVTLKPLNRS SFSANHPI 9 t avf ppslRFNGFRR rki 

MAAEISISASSRAICLQRNPFPAPKYFATAPpllffspltCOTiDAXT^ 

TTGRYHyQLVWCQISPSSTSRTSYYRHSPPLGPKPTPTTPSVTpitpfflpalGSIIiRCRR rps 

AXSVFSTSSFHKltLI&SKQDll^pIllfspvvfe 



svcR 


we 




52 


tvclj 


wa 




53 


aacP 


vlk 




62 


tvclj 


vie 


: 


71 


licl 


vae 




56 




vag 




59 



PsP 



C V 



AtX 

At2 

Cal 

Ca2 

AdKl 

AdK6 



80 * 100 

errqNSPIENDERPESTSSTNAIDAEYLAL 

erkqSSPMDDDNKPESTTSSSEIIJlTS 

ddklTTAQSGKQSDTEAXGDEXEVETNEEKSIJW 
ddkf KTQFEAGEEDXEMKIBBQXSAT 



bee: 



EQTDSGIX-- 
lEQTDSDIV- - 




A S GX SM aVY Ps 



Atl 

At2 

Cal 

Ca2 

AdKl 

AdK6 




MegG 



180 
vg-aavgffief] 
|s VgaawgSe £| 
,fg-aaiggeyj 
_ vg-aavgHefj 
t £ aaiva - aaigfle f : 
|tf atjva- aavg jef 
TPA v Aa O B£ 



200 





w s 




lwhasl 
lwhdalwnM 
lwhaslwh>j 
lwhaslw] 
.whdslwyghSs 
.wh*alwygh£si 
LWH SXiW H 



* 220 

f elzzdvf aivnagpai : 195 

af elndvf aitnavpai : 194 

f elndif aiinavpai : 209 

f elndvf aiinavpai : 210 

f ©fndvfaiinalpai : 198 

f efndvf aiinalpai : 201 
FE HDvPAX NA PAX 



Atl 

At2 

Cal 

Ca2 

AdKl 

AdK6 



gllsygf *x 
gllyygflx 
affsfgfr 
alldygfft 
alinygfsi 
alinygfsi 
1 yGF 



240 

grlvpgflcfgggl: 
jlvpgj cfg&li 
;lipg | cf gggl 
jlipgj cf gggl< 
?llpgac£gtgl 
jllpgjcfgvglg; 
GL PG CFG GLG 



260 



280 




i pvgg iadvpylr! 
£p v 9g ianvpylrl 
i* f P v 9§ iakvpyf q: 
r f pvgB vanvpylr; 
rfpvgSianvpyfi 
[rfpvwjiaavpyf: 
RFPVg iA VPY k 




300 * 320 

Atl : glflgpkeleavgg-n*el< 
At2 : glf lgpkqevaevgGkeal 
Cal : glflgpkeleevgv-ieale 
Ca2 : glf lgpkeleevgg-leel 
AdKl : glflgpqele«v3gGteeleS±s; 

AdK6 : glflgpkala«vJgGteel*3 isi 
GLFLGFkaleEv g EELe 



liksykkaSGSGSSSSS 




310 






305 


ikslkrl 




315 






316 






306 






309 
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SEQUENCE LISTING 

<110> CUNNINGHAM, Francis X. 

<120> CAROTENOID KETOLASE GENES AND GENE PRODUCTS, PRODUCTION 
OF KETOCAROTENOIDS AND METHODS OF MODIFYING CAROTENOIDS 
USING THE GENES 

<130> 8172-9022 

<140> Unknown 
<141> 1999-05-21 

<150> 60/086,460 
<151> 1998-05-22 

<160> 18 

<170> Patentln Ver . 2.0 

<210> 1 
<211> 1176 
<212> DNA 

<213> Adonis aestivalis 
<400> 1 

agcaatctca gtgttcagta caagttattc tttccacaag aatctcttgt tgcactcaaa 60 

acaagacatt ctcaaccgcc catgtttgct cttctctcca gttgtggtgg agtcgcctat 120 

gagaaagaaa aagacacatc gtgctgcatg tatctgctct gttgcagaga gaacaaggaa 180 

ccttgatatt cctcaaattg aagaagagga agagaacgag gaagaactaa tagaacagac 240 

ggattctggc ataattcata taaagaaaac gctagggggg aaacaatcaa gacggtccac 300 

tggctccatt gtcgcacccg. tatcttgtct tgggatcctt tcaatgatcg gacctgctgt 360 

ttacttcaag ttttcacgce taatggagtg tggagatatt cctgtcgcag aaatggggat 420 

tacgtttgcc gcctttgttg ctgctgcgat tggcacggaa tttttgtcag gatgggttca 480 

caaagaactc tggcacgatt ctttgtggta cattcacaag tctcaccata ggtcacgaaa 540 

aggccgcttc gagttcaarg atgtgtttgc tattattaac gcgcttcctg ctattgctct 600 

tatcaattat ggattctcaa atgaaggcct ccttcctgga gcctgctttg gtaccggtct 660 

tggaacgaca gtctgtggca tggcttacat ttttcttcac aatggccttt cacaccgaag 720 

gttcccagta gggcttattg caaacgtccc ttatttccac aagctggctg cagctcacca 780 

aatccatcac tcaggaaaat ttcagggtgt accatttggc ctgttccttg gaccccagga 840 

attggaagaa gtaagaggag gcactgaaga attggagagg gtgatcagtc gtacagctaa 900 

acgaacgcaa tcatctacai: gaatcaactc ttttacattt atgaggtttt agtttatcgg 960 

tgttacaagt cacacatttg tgtcgttgta gtaattcaaa gttaccatac tcttttttag 1020 

aatttttttt tgatgtetag gtcgcggagt tacggttaca aaggccaaat ctattgttgt 1080 

ggaattccat tattaaaaat aaaaattaga gtttgtagtt ttatctggtg atcaatatca 1140 

atatatatta attaaagcaa aaaaaaaaaa aaaaaa 1176 

<210> 2 
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<211> 306 
<212> PRT 

<213> Adonis aestivalis 
<400> 2 

Ala lie Ser Val Phe Ser Thr Ser Tyr Ser Phe His Lys Asn Leu Leu 
15 10 15 

Leu His Ser Lys Gin Asp lie Leu Asn Arg Pro Cys Leu Leu Phe Ser 
20 25 30 

Pro Val Val Val Glu Ser Pro Met Arg Lys Lys Lys Thr His Arg Ala 
35 40 45 

Ala Cys lie Cys Ser Val Ala Glu Arg Thr Arg Asn Leu Asp lie Pro 
50 55 60 

Gin He Glu Glu Glu Glu Glu Asn Glu Glu Glu Leu He Glu Gin Thr 
65 70 75 80 

Asp Ser Gly He He His He Lys Lys Thr Leu Gly Gly Lys Gin Ser 
85 90 95 

Arg Arg Ser Thr Gly Ser He Val Ala Pro Val Ser Cys Leu Gly He 
100 105 110 

Leu Ser Met lie Gly Pro Ala Val Tyr Phe Lys Phe Ser Arg Leu Met 
115 120 125 

Glu Cys Gly Asp lie Pro Val Ala Glu Met Gly He Thr Phe Ala Ala 
130 135 140 

Phe Val Ala Ala Ala He Gly Thr Glu Phe Leu Ser Gly Trp Val His 
145 150 155 160 

Lys Glu Leu Trp His Asp Ser Leu Trp Tyr He His Lys Ser His His 
165 170 175 

Arg Ser Arg Lys Gly Arg Phe Glu Phe Asn Asp Val Phe Ala He He 
180 185 190 

Asn Ala Leu Pro Ala lie Ala Leu He Asn Tyr Gly Phe Ser Asn Glu 
195 200 205 

Gly Leu Leu Pro Gly Ala Cys Phe Gly Thr Gly Leu Gly Thr Thr Val 
210 215 220 

Cys Gly Met Ala Tyr He Phe Leu His Asn Gly Leu Ser His Arg Arg 

2 
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235 
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240 



Phe Pro Val Gly Leu 
245 

Ala Ala His Gin lie 
260 

Gly Leu Phe Leu Gly 
275 

Glu Glu Leu Glu Arc 
290 



lie Ala Asn Val Pro Tyr 
250 

His His Ser Gly Lys Phe 
265 

Pro Gin Glu. Leu Glu Glu 
280 

Val lie Ser Arg Thr Ala 
295 



Phe His Lys Leu Ala 
255 

Gin Gly Val Pro Phe 
270 

Val Arg Gly Gly Thr 
285 

Lys Arg Thr Gin Ser 
300 



Ser Thr 
305 



<210> 3 
<211> 1112 
<212> DNA 

<213> Adonis aestivalis 
<400> 3 

agcaatttca gtgttcagtt caggttattc tttctacaag aatctcttgt tggactcaaa 60 

accaaatatt ctcaaacccc catgcctgct attctctcca gttgtgatca tgtcgcctat 120 

gagaaagaaa aagaaacatg gtgatccatg tatctgctcc gttgcaggga gaacaaggaa 180 

ccttgatatt cctcaaattg aagaagagga agagaatgtg gaagaactaa tagaacagac 240 

cgattctgac atagtgcata taaagaaaac actagggggg aaacaatcaa aacggcccac 300 

tggctccatt gtcgcacccg tatcttgtct tgggatcctt tcaatgattg gacctgctgt 360 

ttacttcaag ttttcacggc taatggaggg tggagatata cctgtagcag aaatggggat 420 

tacgtttgcc acctttgttg ctgctgctgt tggcacggag tttttgtcag catgggttca 480 

caaagaactc tggcacgagt ctttgtggta cattcacaag tctcaccatc ggtcacgaaa 540 

aggccgcttc gagttcaatg atgtgtttgc tattattaac gcgcttcccg ctattgctct 600 

tatcaattat ggattctcca atgaaggcct ccttcctgga gcgtgctttg gtgtcggtct 660 

tggaacaaca gtctgtggta tggcttacat ttttcttcac aatggcctat cacaccgaag 720 

gttcccagta tggcttattg cgaacgtccc ttatttccac aagctggctg cagctcacca 780 

aatacaccac tcaggaaaat ttcagggtgt accatttggc ctgttccttg gacccaagga 840 

attggaagaa gtaagaggag gcactgaaga gttggagagg gtaatcagtc gtacaactaa 900 

acgaacgcaa ccatctacct gaatcaattt ttttacatat ataaggtttt agtttatcgg 960 

tgttataaaa tcacacatcc gtatcgtttt agtaagtcaa agttaagata cttccttctt 1020 

agaatatttt ttgatgtata ggtcgcggat atactgttac actattcgtt gtggaattcc 1080 

attataaaaa aataaaaaaa aaaaaaaaaa aa 1112 



<210> 4 
<211> 306 
<212> PRT 

<213> Adonis aestivalis 



3 
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<400> 4 

Ala lie Ser Val 
1 

Leu Asp Ser Lys 
20 

Pro Val Val He 
35 

Pro Cys He Cys 
50 

Gin He Glu Glu 
65 

Asp Ser Asp He 



Lys Arg Pro Thr 
100 

Leu Ser Met lie 
115 

Glu Gly Gly Asp 
130 

Phe Val Ala Ala 
145 

Lys Glu Leu Trp 



Arg Ser Arg Lys 
180 

Asn Ala Leu Pro 
195 

Gly Leu Leu Pro 
210 

Cys Gly Met Ala 
225 

Phe Pro Val Trp 



Phe Ser Ser Gly 
5 

Pro Asn He Leu 



Met Ser Pro Met 
40 

Ser Val Ala Gly 
55 

Glu Glu Glu Asn 
70 

Val Kis He Lys 
85 

Gly Ser He Val 



Gly Pro Ala Val 
120 

He Pro Val Ala 
135 

Ala Val Gly Thr 
150 

His Glu Ser Leu 
165 

Gly Arg Phe Glu 



Ala lie Ala Leu 
200 

Gly Ala Cys Phe 
215 

Tyr He Phe Leu 
230 

Leu He Ala Asn 



Tyr Ser Phe Tyr 
10 

Lys Pro Pro Cys 
25 

Arg Lys Lys Lys 



Arg Thr Arg Asn 
60 

Val Glu Glu Leu 
75 

Lys Thr Leu Gly 
90 

Ala Pro Val Ser 
105 

Tyr Phe Lys Phe 



Glu Met Gly He 
140 

Glu Phe Leu Ser 
155 

Trp Tyr lie His 
170 

Phe Asn Asp Val 
185 

He Asn Tyr Gly 



Gly Val Gly Leu 
220 

His Asn Gly Leu 
235 

Val Pro Tyr Phe 



Lys Asn Leu Leu 
15 

Leu Leu Phe Ser 
30 

Lys His Gly Asp 
45 

Leu Asp He Pro 



He Glu Gin Thr 
80 

Gly Lys Gin Ser 

95 

Cys Leu Gly lie 
110 

Ser Arg Leu Met 
125 

Thr Phe Ala Thr 



Ala Trp Val His 
160 

Lys Ser His His 
175 

Phe Ala He He 
190 

Phe Ser Asn Glu 
205 

Gly Thr Thr Val 



Ser His Arg Arg 
240 

His Lys Leu Ala 
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255 



Ala Ala His Gin lie His His Ser Gly Lys Phe Gin Gly Val Pro Phe 
260 265 270 

Gly Leu Phe Leu Gly Pro Lys Glu Leu Glu Glu Val Arg Gly Gly Thr 

275 280 285 



Glu Glu Leu Glu Arg Val lie Ser Arg Thr Thr Lys Arg Thr Gin Pro 



290 



295 



300 



Ser Thr 
305 



<210> 5 
<211> 1205 
<212> DNA 

<213> Adonis aestivalis 
<400> 5 

gggctgcagg aattcggcac gagagcaatc tcagtgttca gtacaagtta ttctttccac 60 

aagaatctct tgttgcactc aaaacaagac attctcaacc gcccatgttt gctcttctct 120 

ccagttgtgg tggagtcgcc tatgagaaag aaaaagacac atcgtgctgc atgtatetgc 180 

tctgttgcag agagaacaag gaaccttgat attcctcaaa ttgaagaaga ggaagagaac 240 

gaggaagaac taatagaaca gacggattct ggcataattc atataaagaa aacgctaggg 300 

gggaaacaat caagacggnc cactggctcc attgtcgcac ccgtatcttg tcttgggatc 360 

ctttcaatga tcggacctgr tgtttacttc aagttttcac ggctaatgga gtgtggagat 420 

attcctgtcg cagaaatggg gattacgttt gccgcctttg ttgctgctgc gattggcacg 480 

gaatttttgt caggatggg- tcacaaagaa ctctggcacg attctttgtg gtacattcac 540 

aagtctcacc ataggtcacg aaaaggccgc ttcgagttca atgatgtgtt tgctattatt 600 

aacgcgcttc ctgctattgc tcttatcaat tatggattct caaatgaagg cctccttcct 660 

ggagcctgct ttggtaccgg tcttggaacg acagtctgtg gcatggctta catttttctt 720 

cacaatggcc tttcacaceg aaggttccca gtagggctta ttgcaaacgt cccttatttc 780 

cacaagctgg ctgcagctc^ ccaaatccat cactcaggaa aatttcaggg tgtaccattt 840 

ggcctgttcc ttggacccca ggaattggaa gaagtaagag gaggcactga agaattggag 900 

agggtgatca gtcgtacaec taaacgaacg caatcatcta catgaatcaa ctcttttaca 960 

tttatgaggt tttagtttat cggtgttaca agtcacacat ttgtgtcgtt gtagtaattc 1O20 

aaagttacca tactcttttr tagaattttt ttttgatgta taggtcgcgg agttacggtt 1080 

acaaaggcca aatctattg- tgtggaattc cattattaaa aataaaaatt agagtttgta 1140 

gttttatctg gtgatcaazs tcaatatata ttaattaaag caaaaaaaaa aaaaaaaaac 1200 
tcgag 1205 

<210> 6 
<211> 315 
<212> PRT 

<213> Adonis aestivalis 



5 
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<400> 6 

Met Gly Leu Gin Glu Phe Gly Thr Arg Ala He Ser Val Phe Ser Thr 
15 10 15 

Ser Tyr Ser Phe His Lys Asn Leu Leu Leu His Ser Lys Gin Asp He 
20 25 30 

Leu Asn Arg Pro Cys Leu Leu Phe Ser Pro Val Val Val Glu Ser Pro 
35 40 45 

Met Arg Lys Lys Lys Thr His Arg Ala Ala Cys He Cys Ser Val Ala 
50 55 60 

Glu Arg Thr Arg Asn Leu Asp He Pro Gin He Glu Glu Glu Glu Glu 
65 70 75 80 

Asn Glu Glu Glu Leu lie Glu Gin Thr Asp Ser Gly He He His He 
85 90 95 

Lys Lys Thr Leu Gly Gly Lys Gin Ser Arg Arg Ser Thr Gly Ser He 
100 105 110 

Val Ala Pro Val Ser Cys Leu Gly lie Leu Ser Met lie Gly Pro Ala 
115 120 125 

Val Tyr Phe Lys Phe Ser Arg Leu Met Glu Cys Gly Asp He Pro Val 
130 135 140 

Ala Glu Met Gly lie Thr Phe Ala Ala Phe Val Ala Ala Ala lie Gly 
145 150 155 160 

Thr Glu Phe Leu Ser Gly Trp Val His Lys Glu Leu Trp His Asp Ser 
165 170 175 

Leu Trp Tyr lie His Lys Ser His His Arg Ser Arg Lys Gly Arg Phe 
180 185 190 

Glu Phe Asn Asp Val Phe Ala lie lie Asn Ala Leu Pro Ala He Ala 
195 200 205 

Leu lie Asn Tyr Gly Phe Ser Asn Glu Gly Leu Leu Pro Gly Ala Cys 
210 215 220 

Phe Gly Thr Gly Leu Gly Thr Thr Val Cys Gly Met Ala Tyr lie Phe 
225 230 235 240 

Leu His Asn Gly Leu Ser His Arg Arg Phe Pro Val Gly Leu lie Ala 
245 250 255 
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Asn Val Pro Tyr Phe His Lys Leu Ala Ala Ala His Gin lie His His 
260 265 270 

Ser Gly Lys Phe Gin Giy Val Pro Phe Gly Leu Phe Leu Gly Pro Gin 

275 280 285 

Glu Leu Glu Glu Val Arg Gly Gly Thr Glu Glu Leu Glu Arg Val He 
290 295 300 



Ser Arg Thr Ala Lys Arg Thr Gin Ser Ser Thr 
305 310 315 



<210> 7 
<211> 1141 
<212> DNA 

<213> Adonis aestivalis 
<400> 7 

gggctgcagg aattcggcac gagagcaatt tcagtgttca gttcaggtta ttctttctac 60 

aagaatctct tgttggactc aaaaccaaat attctcaaac ccccatgcct gctattctct 120 

ccagttgtga tcatgtcgcc tatgagaaag aaaaagaaac atggtgatcc atgtatctgc 180 

tccgttgcag ggagaacaag gaaccttgat attcctcaaa ttgaagaaga ggaagagaat 240 

gtggaagaac taatagaaca gaccgattct gacatagtgc atataaagaa aacactaggg 300 

gggaaacaat caaaacggcc cactggctcc attgtcgcac ccgtatcttg tcttgggatc 360 

ctttcaatga ttggacctgc tgtttacttc aagttttcac ggctaatgga gggtggagat 420 

atacctgtag cagaaatggg gattacgttt gccacctttg ttgctgctgc tgttggcacg 480 

gagtttttgt cagcatgggt tcacaaagaa ctctggcacg agtctttgtg gtacattcac 540 

aagtctcacc atcggtcacg aaaaggccgc ttcgagttca atgatgtgtt tgctattatt 600 

aacgcgcttc ccgctattgc tcttatcaat tatggattct ccaatgaagg cctccttcct 660 

ggagcgtgct ttggtgtcgg tcttggaaca acagtctgtg gtatggctta catttttctt 720 

cacaatggcc tatcacaccg aaggttccca gtatggctta ttgcgaacgt cccttatttc 780 

cacaagctgg ctgcagctca ccaaatacac cactcaggaa aatttcaggg tgtaccattt 840 

ggcctgttcc ttggacccaa ggaattggaa gaagtaagag gaggcactga agagttggag 900 

agggtaatca gtcgtacaac taaacgaacg caaccatcta cctgaatcaa tttttttaca 960 

tatataaggt tttagtttat cggtgttata aaatcacaca tccgtatcgt tttagtaagt 1020 

caaagttaag atacttcctt cttagaatat tttttgatgt ataggtcgcg gatatactgt 1080 

tacactattc gttgtggaat tccattataa aaaaataaaa aaaaaaaaaa aaaaactcga 1140 
g 1141 

<210> 8 
<211> 315 
<212> PRT 

<213> Adonis aestivalis 
<400> 8 

Met Gly Leu Gin Glu Phe Gly Thr Arg Ala lie Ser Val Phe Ser Ser 
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15 10 15 

Gly Tyr Ser Phe Tyr Lys Asn Leu Leu Leu Asp Ser Lys Pro Asn lie 
20 25 30 

Leu Lys Pro Pre Cys Leu Leu Phe Ser Pro Val Val lie Met Ser Pro 
35 40 45 

Met Arg Lys Lys Lys Lys His Gly Asp Pro Cys lie Cys Ser Val Ala 
50 55 60 

Gly Arg Thr Arg Asn Leu Asp lie Pro Gin He Glu Glu Glu Glu Glu 
65 70 75 80 

Asn Val Glu Glu Leu He Glu Gin Thr Asp Ser Asp He Val His He 
85 90 95 

Lys Lys Thr Leu Gly Gly Lys Gin Ser Lys Arg Pro Thr Gly Ser He 
100 105 110 

Val Ala Pro Val Ser Cys Leu Gly He Leu Ser Met He Gly Pro Ala 
115 120 125 

Val Tyr Phe Lys Phe Ser Arg Leu Met Glu Gly Gly Asp He Pro Val 
130 135 140 

Ala Glu Met Gly He Thr Phe Ala Thr Phe Val Ala Ala Ala Val Gly 
145 150 155 160 

Thr Glu Phe Leu Ser Ala Trp Val His Lys Glu Leu Trp His Glu Ser 
165 170 175 

Leu Trp Tyr He His Lys Ser His His Arg Ser Arg Lys Gly Arg Phe 
180 185 190 

Glu Phe Asn Asp Val Phe Ala He He Asn Ala Leu Pro Ala He Ala 
195 200 205 

Leu He Asn Tyr Gly Phe Ser Asn Glu Gly Leu Leu Pro Gly Ala Cys 
210 215 220 

Phe Gly Val Gly Leu Gly Thr Thr Val Cys Gly Met Ala Tyr He Phe 
225 230 235~ 240 

Leu His Asn Gly Leu Ser His Arg Arg Phe Pro Val Trp Leu lie Ala 
245 250 255 

Asn Val Pre Tyr Phe His Lys Leu Ala Ala Ala His Gin He His His 
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265 
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270 



Ser Gly Lys Phe Gin Gly Val Pro Phe Gly Leu Phe Leu Gly Pro Lys 
275 280 285 

Glu Leu Glu Glu Val Arg Gly Gly Thr Glu Glu Leu Glu Arg Val lie 
290 295 300 

Ser Arg Thr Thr Lys Arg Thr Gin Pro Ser Thr 
305 310 315 



<210> 9 
<211> 1149 
<212> DNA 

<213> Adonis aestivalis 
<400> 9 

agcaatctCB gtgttcacta caagttattc tttccacaag aatctcttgt tgcactcaaa 60 

acaagacatt ctcaaccccz catgtttgct cttctctcca gttgtggtgg agtcgcctat 120 

gagaaagaaa aagacace:: gtgctgcatg tatctgctct gttgcagaga gaacaaggaa 180 

ccttgatatt cctcaaatzg aagaagagga agagaacgag gaagaactaa tagaacagac 240 

ggattctggc ataattcaia taaagaaaac gctagggggg aaacaatcaa gacggtccac 300 

tggctccatt gtcgcacccc tatcttgtct tgggatcctt tcaatgatcg gacctgctgt 360 

ttacttcaag ttttcacggc taatggagtg tggagatatt cctgtcgcag aaatggggat 420 

tacgtttgcc gcctttgttg ctgctgcgat tggcacggaa tttttgtcag gatgggttca 480 

caaagaactc tggcacgett ctttgtggta cattcacaag tctcaccata ggtcacgaaa 540 

aggccgcttc gagttcaat;c atgtgtttgc tattattaac gcgcttcctg ctattgctct 600 

tatcaattat ggattctcaa atgaaggcct ccttcctgga gcctgctttg gtaccggtct 6€0 

tggaacgaca gtctgtggca tggcttacat ttttcttcac aatggccttt cacaccgaag 720 

gttcccagta gggcttatic caaacgtccc ttatttccac aagctggctg cagctcacca 780 

aatccatcac tcaggaaaa* ttcagggtgt accatttggc ctgttccttg gaccccagga 840 

attggaagaa gtaagaggag gcactgaaga attggagagg gtgatcagtc gtacagctaa 900 

acgaacgcaa tcatctacc; gaatcaactc ttttacattt atgaggtttt agtttatcgg 960 

tgttacaagt cacacattig tgtcgttgta gtaattcaaa cttaccatac tcttttttag 1020 

aatttttttt tgatqtatag gtcgcggagt tacggttaca aaggccaaat ctattgttgt 1080 

ggaattccat tattaaaaa: aaaaattaga gtttgtagtt ttatctggtg atcaatatca 1140 
atatatatt 1149 

<210> 10 

<211> 310 

<212> PRT 

<213> Arabidopsis 

<400> 10 

Met Ala Ala Xaa Leu Ser Thr Ala Val Thr Phe Lys Pro Leu His Arg 
15 10 15 
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Ser Phe Ser Ser 
20 

Leu Ser Gly Phe 

35 

Tyr Val Vai Glu 
50 

Arg Pro Glu Ser 
65 

Ala Leu Arg Leu 



Thr Tyr Leu lie 
100 

Ala Val Met Ala 
115 

Glu lie Ser Met 
130 

Ala Ala Val Gly 
145 

Trp His Ala Ser 



Glu Gly Pro Phe 
180 

Pro Ala lie Gly 
195 

Pro Gly Leu Cys 
210 

Ala Tyr Met Phe 
225 

Gly Pro lie Ala 



Gin Leu His His 
260 



Ser Ser Thr Asp 



Ser Pro Ser Leu 
40 

Glu Arg Arg Gin 
55 

Thr Ser Ser Thr 
70 

Ala Glu Lys Leu 
85 

Ala Ala Met Leu 



Val Tyr Tyr Arg 
120 

Leu Glu Met Phe 
135 

Met Glu Phe Trp 
150 

Leu Trp Asn Met 



Glu x,eu Asn Asp 



Leu Leu Ser Tyr 
200 

Phe Gly Ala Gly 
215 

Vai His Asp Gly 
230 

Asp Val Pro Tyr 
245 

Thr Asp Lys Phe 



Phe Arg Leu Arg 
25 

Arg Phe Lys Arg 



Asn Ser Pro lie 
60 

Asn Ala lie Asp 
75 

Glu Arg Lys Lys 
90 

Ser Ser Phe Gly 
105 

Phe Ser Trp Gin 



Gly Thr Phe Ala 
140 

Ala Arg Trp Ala 
155 

His Glu Ser His 
170 

Val Phe Ala lie 
185 

Gly Phe Phe Asn 



Leu Gly lie Thr 
220 

Leu Val His Lys 
235 

Leu Arg Lys Val 
250 

Asn Gly Val Pro 
265 



Leu Pro Lys Ser 
30 

Phe Ser Val Cys 
45 

Glu Asn Asp Glu 



Ala Glu Tyr Leu 
80 

Ser Glu Arg Ser 
95 

He Thr Ser Met 
110 

Met Glu Gly Gly 
125 

Leu Ser Val Gly 



His Arg Ala Leu 
160 

His Lys Pro Arg 
175 

Val Asn Ala Gly 
190 

Lys Gly Leu Val 
205 

Val Phe Gly He 



Arg Phe Pro Val 
240 

Ala Ala Ala His 
255 

Tyr Gly Leu Phe 
270 
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Leu Gly Pro Lys 
275 

Lys Glu lie Ser 
290 

Gly Ser Ser Ser 
305 



Glu Leu Glu Glu 
280 

Arg Arg lie Lys 
295 

Ser Ser 
310 



Val Gly Gly Asn 



Ser Tyr Lys Lys 
300 



Glu Glu Leu Asp 
285 

Ala Ser Gly Ser 



<210> 11 
<211> 60 
<212> DNA 

<213> Adonis aestivalis 
<400> 11 

cataccataa atagtagagg acaacctaca aaccaaccac cagaaacctc caatggcagc 60 

<210> 12 
<211> 309 
<212> PRT 

<213> Adonis aestivalis 
<400> 12 

Met Ala Ala Ala lie Ser Val Phe Ser Ser Gly Tyr Ser Phe Tyr Lys 
15 10 15 

Asn Leu Leu Leu Asp Ser Lys Pro Asn lie Leu Lys Pro Pro Cys Leu 
20 25 30 

Leu Phe Ser Pro Val Val lie Met Ser Pro Met Arg Lys Lys Lys Lys 
35 40 45 

His Gly Asp Pro Cys He Cys Ser Val Ala Gly Arg Thr Arg Asn Leu 
50 55 60 

Asp He Pro Gin He Glu Glu Glu Glu Glu Asn Val Glu Glu Leu lie 
65 70 75 80 

Glu Gin Thr Asp Ser Asp He Val His He Lys Lys Thr Leu Gly Gly 
85 90 95 

Lys Gin Ser Lys Arg Pro Thr Gly Ser He Val Ala Pro Val Ser Cys 
100 105 HO 

Leu Gly He Leu Ser Met He Gly Pro Ala Val Tyr Phe Lys Phe Ser 
115 120 125 
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Arg Leu Met Glu Gly Giy Asp lie Pro Val Ala Glu Met Gly lie Thr 
130 135 140 

Phe Ala Thr Phe Val Ala Ala Ala Val Gly Thr Glu Phe Leu Ser Ala 
145 150 155 160 

Trp Val His Lys Glu Leu Trp His Glu Ser Leu Trp Tyr lie His Lys 
165 170 175 

Ser His His Arg Ser Arg Lys Gly Arg Phe Glu Phe Asn Asp Val Phe 
180 185 190 

Ala lie lie Asn Ala Leu Pro Ala lie Ala Leu lie Asn Tyr Gly Phe 
-195 200 205 

Ser Asn Glu Gly Leu Leu Pro Gly Ala Cys Phe Gly Val Gly Leu Gly 
210 215 . 220 

Thr Thr Val Cys Gly Met Ala Tyr lie Phe Leu His Asn Gly Leu Ser 
225 230 235 240 

His Arg Arg Phe Pro Val Trp Leu lie Ala Asn Val Pro Tyr Phe His 
245 250 255 

Lys Leu Ala Ala Ala His Gin lie His His Ser Gly Lys Phe Gin Gly 
260 265 270 

Val Pro Phe Gly Leu Phe Leu Gly Pro Lys Glu Leu Glu Glu Val Arg 
275 280 285 

Gly Gly Thr Glu Glu Leu Glu Arg Val lie Ser Arg Thr Thr Lys Arg 
290 295 300 

Thr Gin Pro Ser Thr 
305 



<210> 13 

<211> 310 

<212> PRT 

<213> Arabidopsis 

<400> 13 

Met Ala Ala Xaa Leu Ser Thr Ala Val Thr Phe Lys Pro Leu His Arg 
15 10 15 

Ser Phe Ser Ser Ser Ser Thr Asp Phe Arg Leu Arg Leu Pro Lys Ser 
20 25 30 

12 
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Leu Ser Gly Phe Ser Pro Ser Leu Arg Phe Lys Arg Phe Ser Val Cys 
35 40 45 

Tyr Val Val Glu Glu Arg Arg Gin Asn Ser Pro lie Glu Asn Asp Glu 
50 55 60 

Arg Pro Glu Ser Thr Ser Ser Thr Asn Ala lie Asp Ala Glu Tyr Leu 
65 70 75 80 

Ala Leu Arg Leu Ala Glu Lys Leu Glu Arg Lys Lys Ser Glu Arg Ser 
85 90 95 

Thr Tyr Leu lie Ala Ala Met Leu Ser Ser Phe Gly lie Thr Ser Met 
100 105 110 

Ala Val Met Ala Val Tyr Tyr Arg Phe Ser Trp Gin Met Glu Gly Gly 
115 120 125 

Glu lie Ser Met Leu Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly 
130 135 140 

Ala Ala Val Gly Met Glu Phe Trp Ala Arg Trp Ala His Arg Ala Leu 
145 150 155 160 

Trp His Ala Ser Leu Trp Asn Met His Glu Ser His His Lys Pro Arg 
165 170 175 

Glu Gly Pro Phe Glu Leu Asn Asp Val Phe Ala He Val Asn Ala Gly 
180 185 190 

Pro Ala He Gly Leu Leu Ser Tyr Gly Phe Phe Asn Lys Gly Leu Val 
195 200 205 

Pro Gly Leu Cys Phe Gly Ala Gly Leu Gly He Thr Val Phe Gly He 
210 215 220 

Ala Tyr Met Phe Val His Asp Gly Leu Val His Lys Arg Phe Pro Val 
225 230 235 240 

Gly Pro He Ala Asp Val Pro Tyr Leu Arg Lys Val Ala Ala Ala His 
245 250 255 

Gin Leu His His Thr Asp Lys Phe Asn Gly Val Pro Tyr Gly Leu Phe 
260 265 270 

Leu Gly Pro Lys Giu Leu Glu Glu Val Gly Gly Asn Glu Glu Leu Asp 
275 280 285 

13 
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Lys Glu lie Ser Arg Arg lie Lys Ser Tyr Lys Lys Ala Ser Gly Ser 
290 295 300 

Gly Ser Ser Ser Ser Ser 
305 310 



<210> 14 

<211> 305 

<212> PRT 

<213> Arabidopsis 

<400> 14 

Met Ala Ala Gly Leu Ser Thr lie Ala Val Thr Leu Lys Pro Leu Asn 
15 10 15 

Arg Ser Ser Phe Ser Ala Asn His Pro lie Ser Thr Ala Val Phe Pro 

20 25 30 

Pro Ser Leu Arg Phe Asn Gly Phe Arg Arg Arg Lys lie Leu Thr Val 
35 40 45 

Cys Phe Val Val Glu Glu Arg Lys Gin Ser Ser Pro Met Asp Asp Asp 
50 55 60 

Asn Lys Pro Glu Ser Thr Thr Ser Ser Ser Glu He Leu Met Thr Ser 
65 70 75 80 

Arg Leu Leu Lys Lys Ala Glu Lys Lys Lys Ser Glu Arg Phe Thr Tyr 
85 90 95 

Leu He Ala Ala Val Met Ser Ser Phe Gly He Thr Ser Met Ala He 
100 105 110 

Met Ala Val Tyr Tyr Arg Phe Ser Trp Gin Met Lys Gly Gly Glu Val 
115 120 125 

Ser Val Leu Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly Ala Ala 
130 135 140 

Val Val Gly Met Glu Phe Trp Ala Arg Trp Ala His Arg Ala Leu Trp 
145 150 155 160 

His Asp Ser Leu Trp Asn Met His Glu Ser His His Lys Pro Arg Glu 
165 170 175 



Gly Ala Phe Glu Leu Asn Asp Val Phe Ala lie Thr Asn Ala Val Pro 

14 
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180 185 



190 



Ala lie Gly Leu Leu Tyr Tyr Gly Phe Leu Asn Lys Gly Leu Val Pro 
195 200 205 

Gly Leu Cys Phe Gly Ala Gly Leu Gly He Thr Met Phe Gly Met Ala 
210 215 220 

Tyr Met Phe Val His Asp Gly Leu Val His Lys Arg Phe Pro Val Gly 
225 230 235 240 

Pro lie Ala Asn Val Pro Tyr Leu Arg Lys Val Ala Ala Ala His Gin 
245 250 255 

Leu His His Thr Asp Lys Phe Lys Gly Val Pro Tyr Gly Leu Phe Leu 
260 265 270 

Gly Pro Lys Gin Giu Val Glu Glu Val Gly Gly Lys Glu Glu Leu Glu 
275 280 285 

Lys Glu He Ser Arg Arg He Lys Leu Tyr Asn Lys Gly Ser Ser Thr 
29 0 295 300 

Ser 
305 



<210> 15 
<211> 315 
<212> PRT 

<213> Capsicum annuum 



<400> 15 
Met Ala Ala Glu 
1 

Gin Arg . Asn Pro 
20 

Leu Leu Phe Phe 

35 

Ser Arg Arg Lys 
50 

Lys Leu Tyr Thr 
65 



He Ser He Ser 
5 

Phe Pro Ala Pro 



Ser Pro Leu Thr 
40 

Pro Arg Leu Ala 
55 

Ala Gin Ser Gly 

70 



Ala Ser Ser Arg 
10 

Lys Tyr Phe Ala 
25 

Cys Asn Leu Asp 



Ala Cys Phe Val 
60 

Lys Gin Ser Asp 

75 



Ala He Cys Leu 
15 

Thr Ala Pro Pro 
30 

Ala He Leu Arg 
45 

Leu Lys Asp Asp 



Thr Glu Ala He 
80 
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Gly Asp Glu lie Glu Val Glu Thr Asn Glu Glu Lys Ser Leu Ala Val 
85 90 95 

Arg Leu Ala Glu Lys Phe Ala Arg Lys Lys Ser Glu Arg Phe Thr Tyr 
100 105 110 

Leu Val Ala Ala Val Met Ser Ser Leu Gly He Thr Ser Met Ala Val 
115 120 125 

He Ser Val Tyr Tyr Arg Phe Ser Trp Gin Met Glu Gly Gly Glu Met 
130 135 140 

Pro Phe Ser Glu Met Phe Cys Thr Phe Ala Leu Ala Phe Gly Ala Ala 
145 150 155 160 

He Gly Met Glu Tyr Trp Ala Arg Trp Ala His Arg Ala Leu Trp His 
165 170 175 

Ala Ser Leu Trp His Met His Glu Ser His His Arg Pro Arg Glu Gly 
180 185 190 

Pro Phe Glu Leu Asn Asp He Phe Ala He He Asn Ala Val Pro Ala 
195 200 205 

He Ala Phe Phe Ser Phe Gly Phe Asn His Lys Gly Leu He Pro Gly 
210 215 220 

He Cys Phe Gly Ala Gly Leu Gly He Thr Val Phe Gly Met Ala Tyr 
225 230 235 240 

Met Phe Val His Asp Gly Leu Val His Lys Arg Phe Pro Val Gly Pro 
245 250 255 

He Ala Lys Val Pro Tyr Phe Gin Arg Val Ala Ala Ala His Gin Leu 
260 265 270 

His His Ser Asp Lys Phe Asp Gly Val Pro Tyr Gly Leu Phe Leu Gly 
275 280 285 

Pro Lys Glu Leu Glu Glu Val Gly Val lie Glu Glu Leu Glu Lys Glu 
290 295 300 



Val Asn Arg Arg lie Lys Ser Leu Lys Arg Leu 
305 310 315 



<210> 16 
<211> 316 



16 
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<212> PRT 

<213> Capsicum annuum 
<400> 16 

Thr Thr Gly Arg Tyr His Tyr Gin Leu Val Trp Cys Gin lie Ser Phe 
15 10 15 

Ser Ser Thr Ser Arg Thr Ser Tyr Tyr Arg His Ser Pro Phe Leu Gly 
20 25 30 

Pro Lys Pro Thr Pro Thr Thr Pro Ser Val Tyr Pro He Thr Pro Phe 
35 40 45 

Ser Pro Asn Leu Gly Ser He Leu Arg Cys Arg Arg Arg Pro Ser Phe 
50 55 60 

Thr Val Cys Phe Val Leu Glu Asp Asp Lys Phe Lys Thr Gin Phe Glu 
65 70 75 80 

Ala Gly Glu Glu Asp He Glu Met Lys He Glu Glu Gin He Ser Ala 
85 90 95 

Thr Arg Leu Ala Glu Lys Leu Ala Arg Lys Lys Ser Glu Arg Phe Thr 
100 105 110 

Tyr Leu Val Ala Ala Val Met Ser Ser Phe Gly lie Thr Ser Met Ala 
115 120 125 

Val Met Ala Val Tyr Tyr Arg Phe Tyr Trp Gin Met Glu Gly Gly Glu 
130 135 140 

Val Pro Phe Ser Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly Ala 
145 150 155 160 

Ala Val Gly Met Glu Phe Trp Ala Arg Trp Ala His Lys Ala Leu Trp 
165 170 175 

His Ala Ser Leu Trp His Met His Glu Ser His His Lys Pro Arg Glu 
180 185 190 

Gly Pro Phe Glu Leu Asn Asp Val Phe Ala He He Asn Ala Val Pro 
195 200 205 

Ala lie Ala Leu Leu Asp Tyr Gly Phe Phe His Lys Gly Leu He Pro 
210 215 220 

Gly Leu Cys Phe Gly Ala Gly Leu Gly He Thr Val Phe Gly Met Ala 
225 230 235 240 

17 
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Tyr Met Phe Val His Asp Gly Leu Val His Lys Arg Phe Pro Val Gly 
245 250 255 

Pro Val Ala Asn Val Pro Tyr Leu Arg Lys Val Ala Ala Ala His Ser 
260 265 270 

Leu His His Ser Glu Lys Phe Asn Gly Val Pro Tyr Gly Leu Phe Leu 
275 280 285 

Gly Pro Lys Glu Leu Glu Glu Val Gly Gly Leu Glu Glu Leu Glu Lys 
290 295 300 

Glu Val Asn Arg Arg Thr Arg Tyr He Lys Gly Ser 
305 310 315 



<210> 17 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 17 

cagaatcggt ctgttctatt agttcttcc 

<210> 18 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 18 

caatttgagg aatatcaagc ttccttgttc tc 
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